GBE 



Evolution of Retinoid and Steroid Signaling: Vertebrate 
Diversification from an Amphioxus Perspective 

Ricard Albalat 1 '*'|, Frederic Brunet 2 't, Vincent Laudet 2 , and Michael Schubert 2 '* 

1 Departament de Genetica, Facultat de Biologia and Institut de Recerca de la Biodiversitat, Universitat de Barcelona, Spain 

2 lnstitut de Genomique Fonctionnelle de Lyon (UCBL, CNRS UMR5242, ENSL, INRA 1288), Ecole Normale Superieure de Lyon, 
France 

^Corresponding author: E-mail: ralbalat@ub.edu. 
|These authors contributed equally to this work. 
Accepted: 2 August 2011 

Abstract 

Although the physiological relevance of retinoids and steroids in vertebrates is very well established, the origin and evolution 
of the genetic machineries implicated in their metabolic pathways is still very poorly understood. We investigated the 
evolution of these genetic networks by conducting an exhaustive survey of components of the retinoid and steroid pathways 
in the genome of the invertebrate chordate amphioxus (Branchiostoma floridae). Due to its phylogenetic position at the base 
of chordates, amphioxus is a very useful model to identify and study chordate versus vertebrate innovations, both on 
a morphological and a genomic level. We have characterized more than 220 amphioxus genes evolutionarily related to 
vertebrate components of the retinoid and steroid pathways and found that, globally, amphioxus has orthologs of most of 
the vertebrate components of these two pathways, with some very important exceptions. For example, we failed to identify 
a vertebrate-like machinery for retinoid storage, transport, and delivery in amphioxus and were also unable to characterize 
components of the adrenal steroid pathway in this invertebrate chordate. The absence of these genes from the amphioxus 
genome suggests that both an elaboration and a refinement of the retinoid and steroid pathways took place at the base of 
the vertebrate lineage. In stark contrast, we also identified massive amplifications in some amphioxus gene families, most 
extensively in the short-chain dehydrogenase/reductase superfamily, which, based on phylogenetic and genomic linkage 
analyses, were likely the result of duplications specific to the amphioxus lineage. In sum, this detailed characterization of 
genes implicated in retinoid and steroid signaling in amphioxus allows us not only to reconstruct an outline of these 
pathways in the ancestral chordate but also to discuss functional innovations in retinoid homeostasis and steroid-dependent 
regulation in both cephalochordate and vertebrate evolution. 

Key words: Branchiostoma floridae, cephalochordates, gene duplication, metabolic networks, nuclear receptor signaling 
cascades. 



Introduction 

Acting as intercellular messengers, retinoids and steroids 
contribute to the coordination of developmental processes 
and cellular functions in vertebrates. At the signaling level, 
both signaling systems rely on members of the nuclear hor- 
mone receptor family to regulate transcription of target 
genes, and, classically, the evolution of the retinoid and ste- 
roid pathways has been inferred by identification and char- 
acterization of nuclear hormone receptors in different 
animal lineages (Mangelsdorf et al. 1995; Escriva et al. 
1997, 2000, 2004; Bertrand et al. 2004; Markov et al. 



2008). In contrast, although retinoids and steroids use re- 
lated machineries and similar strategies to regulate their 
physiological activity, the evolution of these metabolic path- 
ways is still poorly understood. 

In vertebrates, retinoid metabolism, storage and transport 
requires no less than seven enzymatic activities, three binding 
capacities, one cell-surface receptor, and one plasma protein 
(fig. ^A) (for recent reviews on retinoid metabolism, see Alba- 
lat 2009; Theodosiou, et al. 201 0). In brief, retinoid metabo- 
lism begins when dietary p-carotenes, the main source of 
retinoids, are cleaved by p,p-carotene-1 5,1 5'-oxygenase 
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Fig. 1. — Vertebrate retinoid {A) and steroid (B) metabolism. (A) The main components of retinoid metabolism, storage, and transport in 
vertebrates are shown. Proteins are boxed, and retinoids are in black. Proteins of the same family are shown in identical colors. Enzymatic reactions are 
shown with arrows. The dotted arrow indicates retinoid transport and dotted lines highlight retinoid binding to proteins. {B) The main enzymatic 
reactions of steroidogenesis in vertebrates are indicated. Enzymatic reactions are shown with arrows. Proteins are boxed, and steroids are in black. 
Proteins of the same family are shown in identical colors. 
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(Bcmol ; also known as Bco or Bcox), which generates two 
molecules of retinal. This retinal is subsequently reduced to 
retinol (vitamin A) by retinaldehyde reductases, a heteroge- 
neous group of enzymes of diverse protein families (Dalfo 
et al. 2007 and references therein). The retinol is esterified 
to retinyl esters by lecithin-retinol acyltransferase (Lrat) or 
acyl-CoA-retinol acyltransferase (Dgatl ) enzymes and stored 
in the liver. Retinyl esters can be mobilized from the liver by 
retinyl ester hydrolases (including Rpe65) that transform ret- 
inyl esters back to retinol. In the blood stream, this retinol is 
transported by retinol-binding protein 4(Rbp4) and transthyr- 
etin (Ttr) and the subsequent cellular uptake of retinol is me- 
diated by Stra6 (stimulated by retinoic acid protein 6) 
(Kawaguchi et al. 2007; Theodosiou et al. 201 0). 

Within the cell, retinol is oxidized to retinal and, in a sec- 
ond step, to retinoic acid (RA), the main physiologically 
active retinoid. In this process, retinol oxidation is the 
rate-limiting step and a key reaction for the physiological 
control of RA availability. Several dehydrogenases of the 
medium-chain dehydrogenase/reductase (MDR-Adh) super- 
family, including Adh1, Adh3, and Adh4, and of the short- 
chain dehydrogenase/reductase (SDR) superfamily, such as 
Rdh16 (also known as Rodh4) and Rdh10, catalyze this 
rate-limiting reaction in vitro (Gough et al. 1998; Belyaeva, 
Johnson, et al. 2008). Retinal produced by these enzymes 
is then transformed to RA through the action of retinaldehyde 
dehydrogenases (Aldhla and Aldh8a), which, in close coor- 
dination with RA-degrading enzymes of the cytochrome 
P450 subfamily 26 (Cyp26) class, regulate the spatiotemporal 
levels of RA (Niederreither et al. 2002; Reijntjes et al. 2005). 
Inside the cell, retinol, retinal and RA are generally bound to 
proteins that solubilize and stabilize retinoids in the aqueous 
environment of thecell. In vertebrates, several cellular retinol- 
binding proteins (Crbp) and cellular RA binding proteins 
(Crabp) have been identified (reviewed in Noy 2000), al- 
though it is still unclear whether the main physiological task 
of these binding proteins is to deliver retinoids to or to seques- 
ter them from the enzymes. 

Similar to retinoid metabolism, steroidogenesis (fig. 1 B) is 
also a complex biochemical pathway that involves numerous 
cytochrome P450 (Cyp) and hydroxysteroid dehydrogenases 
(Hsd) enzymes (Payne and Hales 2004). In vertebrates, ste- 
roidogenesis is initiated by Cyp11a enzymes that produce 
pregnenolone from cholesterol transferred from the outer 
to the inner membrane of mitochondria by Star (for steroido- 
genic acute regulatory protein, also known as Stardl) or by 
Stard3 (for Star-related lipid transfer domain containing 3, 
also known as Mln64) proteins (Clark et al. 1994; Watari 
et al. 1 997; Stocco 2001 ). Pregnenolone is transformed into 
progesterone by 3 p- hydroxysteroid dehydrogenase/A 5 -A 4 - 
isomerase (Hsd3p) or to dehydroepiandrosterone (DHEA) 
by Cyp1 7. Progesterone can be metabolized to aldosterone, 
Cortisol, or androstenedione through different pathways in- 
volving Cyp11b1, Cyp11b2, Cyp17, and Cyp21 enzymes, 



whereas DHEA can be transformed to androstanediol and 
to dihydrotestosterone (5aDHT), a potent androgen. 5aDHT 
can also be synthesized from androstenedione via testoster- 
one by the action of type 1 , type 2, and type 3 5ot-reductases 
(Srd5oc1, Srd5oc2, and Srd5oc3) (Russell and Wilson 1994; 
Uemura et al. 2008). Finally, androstenedione can be trans- 
formed to estrone by the aromatase enzyme Cyp1 9 and fur- 
ther to estradiol by Hsd enzymes of the SDR and aldo-keto 
reductases (Akr) superfamilies (reviewed in Payne and Hales 
2004). 

In vertebrates, pairs of Hsd enzymes participate in a prere- 
ceptor regulatory system of steroid action through the inter- 
conversion of weak to potent hormones (Penning 2003). This 
prereceptor regulation was first documented for mineralo- 
corticoid receptors and Hsd1 1 (3 enzymes (Funder et al. 
1 988; Oppermann et al. 1 997). Hsd 1 1 p type 1 enzymes con- 
vert inactive cortisone into the potent glucocorticoid Cortisol, 
whereas Hsd 1 1 p type 2 enzymes catalyze the opposite reac- 
tion, hence neutralizing the hormone. Other pairs of activat- 
ing/deactivating Hsd enzymes are: Hsd17pi (and Hsd17pi2 
and Akr1c3) versus Hsd17p2 (and Hsd 17 p8) enzymes for es- 
trone (weak) to estradiol (potent) conversion (and vice versa), 
Hsd17p3 (and Akr1c3)and Hsd17p2 (and Hsd17p8) for an- 
drostenedione (weak) to testosterone (potent) conversion 
(and vice versa), and Akr1 c2 (and Hsd1 7p6, Rdh5, and Dhrs9) 
and Akr1c3 (and Akr1c2 and Hsd17p8) for androstanediol 
(weak) to 5ocDHT (potent) conversion (and vice versa) (Biswas 
and Russell 1997; Fomitcheva et al. 1998; Penning et al. 
2000; Chetyrkin, Belyaeva, et al. 2001; Chetyrkin, Hu, 
et al. 2001; Huang and Luu-The 2001). 

Although the retinoid and steroid metabolic pathways 
have extensively been studied in vertebrates, the evolution- 
ary origins of these signaling systems are not fully under- 
stood. To investigate the evolution of these signaling 
systems in vertebrates, we have analyzed the genome of 
the invertebrate chordate amphioxus (Branchiostoma flori- 
dae). Phylogenetically positioned at the base of the chordate 
phylum, amphioxus is the extant invertebrate most closely 
resembling the last invertebrate ancestor of vertebrates, 
both in terms of morphology and genome (Schubert, Escriva, 
etal. 2006; Holland etal. 2008; Putnam etal. 2008). We have 
identified more than 220 amphioxus genes evolutionarily re- 
lated to vertebrate components of the retinoid and steroid 
metabolisms, and our phylogenetic analyses allowed us to 
reconstruct the retinoid and steroid machineries of the an- 
cestral chordate. Interestingly, we also found compelling 
evidence for massive expansions of some components 
of the retinoid and steroid pathways in amphioxus (most 
distinctively within the SDR superfamily). Based on phylo- 
genetic and genomic linkage data, we hypothesize that 
these expansions arose by extensive lineage-specific gene 
duplications, suggesting that the retinoid and steroid path- 
ways in amphioxus are much more complex than initially 
expected. Taken together, our work demonstrates that 
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functional diversification upon gene duplication in chordates 
is by no means limited to the vertebrate lineage. 

Materials and Methods 

Genome Analyses, Gene Identification, and 
Sequence Alignment 

We have carried out exhaustive searches of the amphioxus 
(B. floridae) genome (versions 1.0 and 2.0) (genome. 
jgi-psf.org/Brafl1/Brafl1 .home.html) for components of ret- 
inoid and steroid metabolism. Version 1 .0 of the B. floridae 
genome was preferentially used because it is more inclusive 
than version 2.0, from which some sequences have been 
removed (Putnam et al. 2008). Moreover, for the genomic 
linkage survey of the B. floridae sequences, we needed ge- 
nome position information, which is unavailable for version 
2.0 (Putnam et al. 2008). 

As recommended by Kuzniar et al. (2008), we searched 
the amphioxus genome for orthologous sequences using 
precomputed sets of homologous genes. In brief, for 
each component of retinoid and steroid signaling, we 
downloaded all related sequences from the protein 
families defined in the Ensembl database (version 52) 
(www.ensembl.org) (Flicek et al. 201 0), to which we added 
sequences from Uniprot (www.uniprot.org) and from the 
nonredundant protein database at the National Center 
for Biotechnology Information (www.ncbi.nlm.nih.gov). 
Whenever possible, we added other families to the analysis 
to properly outgroup our phylogenies. Sequences identified 
in the genome of the cnidarian Nematostella vectensis (ver- 
sion 1 .0) (genome.jgi-psf.org/Nemve1/Nemve1 .home.html) 
were also included in the analyses and were used wherever 
possible to root the phylogenetic trees. Protein alignments 
were calculated with ClustalW (version 2.0) (Larkin et al. 
2007) and Muscle (version 3.6) (Edgar 2004). The results 
of both alignment tools were merged and subsequently 
manually refined using SeaView (version 3.2) (Galtier 
et al. 1 996) to exclude redundant and improperly annotated 
sequences. In contrast to our initial survey of the amphioxus 
genome, where only human sequences were used for Blast 
searches (Holland et al. 2008), in the present analysis, all 
members of a given protein family were used for Blast 
searches of the complete predicted protein data sets of 
the amphioxus genome. This comprehensive methodology 
yielded an inclusive and reproducible set of amphioxus 
sequences potentially homologous to the different protein 
families of interest. 

All amphioxus sequences obtained by these database 
searches were retrieved, sorted, and allelic polymorphs were 
removed using nucleotide and amino acid alignments. In or- 
der to identify allelic polymorphs, nucleotide alignments 
were analyzed by eye and two sequences were considered 
as alleles, when, with the exception of a few polymorphic 
changes at third codon positions, large stretches of the 



compared sequences were identical. Intronic sequences 
were also aligned, and conservation of intronic nucleotidic 
sequences stretches were used as additional criterion for the 
identification of allelic polymorphs. Of the different allelic 
polymorphs, the sequence with the longest overall align- 
ment with the gene family in question was retained. The 
sequence identification numbers of all retrieved amphioxus 
sequences are given in the Supplementary Material online. 
The genomic position of each amphioxus gene was ob- 
tained from version 1 .0 of the B. floridae genome (Putnam 
et al. 2008) and used as a basis for genomic linkage anal- 
yses. Automatically annotated genes were manually revised 
taking into consideration the data available in both version 
1.0 and version 2.0 of the B. floridae genome, and errors 
were corrected to maximize the similarity with known pro- 
teins. This additional verification step involving both versions 
of the B. floridae genome yielded highly consistent results. 
The treated amphioxus data were subsequently incorpo- 
rated into the protein family alignments used for the initial 
Blast analysis of the amphioxus genome. Alignments were 
obtained using ClustalW (version 2.0) (Larkin et al. 2007) 
and Muscle (version 3.6) (Edgar 2004). The results of both 
alignment tools were merged and subsequently subjected 
to manual refinement. All alignment files are available from 
the authors. 

Phylogenetic Analyses 

All sequences retrieved from the amphioxus genome were 
subjected to phylogenetic analyses in the context of their 
proper protein families. For each family, phylogenies were 
calculated with sequences from humans, zebrafish, and 
amphioxus using N. vectensis sequences as outgroup. 
The SDR tree was calculated without N. vectensis repre- 
sentatives, and each family hence served as outgroup 
for the other families, which resulted in a reliable phyloge- 
netic reconstruction of this superfamily. The iLbp tree was 
also calculated without N. vectensis representatives be- 
cause we failed to identify iLbp-like sequences in the N. 
vectensis genome. Phylogenetic tree reconstructions were 
carried out using both the neighbor joining (NJ) and the 
maximum likelihood (ML) methods. For both NJ and ML 
analyses, robustness of the obtained tree topologies 
was assessed with 1,000 bootstrap replicates. NJ trees 
were calculated with Phylo_Win (version 2.0) using a Poisson 
correction and pairwise gap removal (Galtier et al. 1996), 
whereas ML trees were obtained with PhyML (version 
2.4.4) using a Jones, Taylor, and Thornton model, a discrete 
gamma model with four categories, a gamma shape param- 
eter of two and a ML estimate of the proportion of invariable 
sites (Guindon and Gascuel 2003). Phylogenetic trees were 
rooted with NJPIot (version 1.0) (pbil.univ-lyon1.fr/ 
software/njplot.html) (Perriere and Gouy 1996) and format- 
ted using FigTree (version 1.3.1) (tree.bio.ed.ac.uk). 
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Results 

We have carried out an exhaustive search of the amphioxus 
(B. floridae) genome for components of retinoid and steroid 
metabolism. To our great surprise, we found evidence for 
amphioxus-specific duplications in most of the families ana- 
lyzed. The best example for this tendency, with more than 
100 representatives in amphioxus (compared with only 37 
in humans), is the SDR superfamily. We will hence first 
describe our analysis of amphioxus homologs of vertebrate 
dehydrogenases/reductases with activity against retinol 
and/or hydroxysteroids, before presenting other elements 
implicated in retinoid and steroid metabolism. 

Identification of Retinol and Hydroxysteroid 
Dehydrogenases/Reductases in Amphioxus 

SDR-Rdh and SDR-Hsd Sequences 

The SDR superfamily includes 73 human members with 
a substrate spectrum ranging from polyols, retinoids, ste- 
roids, and fatty acid derivatives to xenobiotics. We have fo- 
cused our analysis on those SDR families that include 
enzymes active against retinoids and steroids (Belyaeva 
and Kedishvili 2006; Dalfo et al. 2007; Wu et al. 2007). Ac- 
cording to the recent reclassification of the SDR superfamily, 
based on sequence analyses, predicted secondary structure 
(Bray et al. 2009), and hidden Markov models (Kallberg et al. 
2010), we have therefore analyzed the SDR-Rdh and SDR- 
Hsd enzyme families 5C, 7C, 8C, 9C, 12C, 16C, 25C, 26C, 
28C, 30C, 32C, 34C, 37C, and 47C, with a total of 37 mem- 
bers in humans. A survey of the B. floridae genome identi- 
fied 107 genes encoding SDR enzymes belonging to these 
families and, hence, with putative functions in amphioxus 
retinoid or steroid metabolism. Of these 107 amphioxus 
SDRs, 6 partial sequences were excluded because they were 
too short for reliable phylogenetic tree reconstruction. Phy- 
logenetic analysis of the retained amphioxus, zebrafish, and 
human SDR representatives combined with genomic linkage 
analysis of the amphioxus genes revealed the evolutionary 
history of the chordate Rdh/Hsd enzymes. These results will 
be described following the order of the individual SDR fam- 
ilies within the overall SDR tree (fig. 2; supplementary fig. 
S1, Supplementary Material online). 

Family SDR-26C: Hsd1ipi-Hsd1ipil Enzymes 

Hsd1 1 p type 1 enzymes convert the inactive glucocorticoid 
cortisone into the potent Cortisol, and, in concert with 
Hsdlip type 2, regulate Cortisol levels (Funder et al. 
1988). Hsd11p1 enzymes are closely related to the 
Hsd11p1l proteins (also named Sdr10b or Hsd11p3 in 
chicken and fish), whose function remains unknown (Baker 
2004b). The amphioxus genome contains 23 family SDR- 
26C genes (fig. 2; supplementary fig. S1, Supplementary 
Material online; NJ, support value 97 of 100; ML, support 



value 1,000 of 1,000), most of them are found as tandem 
duplicates located on four scaffolds of the amphioxus ge- 
nome (table 1). Our phylogenetic analysis suggests that 
the amphioxus enzymes are orthologous to both vertebrate 
Hsd1 1 pi and Hsd1 1 pi I, and we hence named the sequen- 
ces Bf_Hsd1 1 pi/1l_1 to 23. The genomic organization to- 
gether with the tree topology suggest that the 
multiplicity of family SDR-26C members in amphioxus arose 
by lineage-specific duplication and further supports the no- 
tion that Hsdlipi and Hsdiipil diverged early in 
vertebrate evolution (Baker 2004b). 

Family SDR-34C: Dhrs7 Enzymes 

The human DHRS7 gene was identified from database 
searches of ESTs from retina and pineal gland (Haeseleer 
and Palczewski 2000) and, although no biochemical activity 
has been detected for DHRS7 so far, is expressed in the retina 
as well as in skeletal muscle and heart. Three amphioxus se- 
quences, Bf_Dhrs7_1 to3, grouped with the vertebrate Dhrs7 
enzymes (fig. 2; supplementary fig. S1, Supplementary 
Material online; NJ, 99; ML, 997). The tree topology indicates 
that the three amphioxus Dhrs7 genes duplicated specifically 
in the cephalochordate lineage. 

Family SDR-32C: Dhrs7b-Dhrs7c Enzymes 

Human DHRS7b and DHRS7c are predicted genes located 
on chromosome 17 that encode SDR proteins of unknown 
function. A single amphioxus sequence, called Bf_Dhrs7b/c, 
reliably clustered with the vertebrate Dhrs7b-Dhrs7c family 
(fig. 2; supplementary fig. S1, Supplementary Material 
online; NJ, 99; ML, 996) indicating the presence of this 
SDR family in the last common ancestor of amphioxus 
and vertebrates. 

Family SDR-16C: Rdh10-RdhE2-RdhE2l-Hsd17pi1- 
Hsd17pi3-Dhrs3 Enzymes 

Family SDR-16C includes vertebrate Rdh10, RdhE2 (for epi- 
dermal retinaldehyde dehydrogenase 2), RdhE2l (for RdhE2- 
like), Hsd17pi1, Hsd17p13, and Dhrs3 enzymes, which 
have diverse activities against retinoids and steroids. Verte- 
brate Rdh10, RdhE2, RdhE2l, and Dhrs3 have been involved 
in retinoid metabolism during differentiation and develop- 
ment, in pathological states and in the visual cycle. 
Hsd17pi 1 has been related to androgen metabolism during 
steroidogenesis, and the function of Hsd17pi3, a hepatic 
enzyme with a high degree of sequence identity with 
Hsd17pi 1, is still unknown. A total of eight amphioxus se- 
quences group with vertebrate family SDR-16C enzymes 
(fig. 2; supplementary fig. S1, Supplementary Material 
online; NJ, 100; ML, 1,000). Our phylogenetic analysis 
shows that one amphioxus sequence, Bf_Rdh10, is similar 
to Rdh10, three sequences, Bf_RdhE2_1-3, all of which lo- 
cated on the same scaffold (table 1), are related to both 
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Fig. 2. — Schematic phylogeny of members of the SDR superfamily with activity against retinoids and/or steroids. The nomenclature of the three major SDR 
clusters (C 1 , C2, and C3) as well as of the individual SDR families is based on the recent reclassification of the SDR superfamily (Bray et al. 2009; Kallberg et al. 201 0). 
Bootstrap values (NJ/ML) supporting the three major SDR clusters (C 1 , C2, and C3) and each of the 1 4 SDR families are indicated. Retinoid (blue) and steroid (orange) 
activities of the vertebrate enzymes are indicated, illustrating that both enzymatic activities are commonly found within the SDR superfamily, sometimes even within 
a single SDR family. Vertebrate branches are in green, whereas amphioxus branches are in red. The number of amphioxus sequences in each branch is indicated. 
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RdhE2and RdhE2l, two sequences, Bf_Hsd17pi 1/1 3_1 and 
2, are associated with the vertebrate Hsd17pi 1-Hsd17pi3 
clade, and two sequences, Bf_Dhrs3_1 and 2, cluster with 
the vertebrate Dhrs3 enzymes, hence establishing an almost 
complete repertoire of amphioxus family SDR-16C mem- 
bers. The phylogenetic analysis thus indicates that the diver- 
sification of family SDR-16C took place before the 
cephalochordate-vertebrate split. 

Family SDR-12C: Hsd17p3-Hsd17pi2-HsdL1 
Enzymes 

This family includes several enzymes involved in the produc- 
tion of sexual hormones and in the metabolism of fatty acids 
(Geissler et al. 1 994; Huang et al. 2001 ; Moon and Horton 
2003; Luu-The et al. 2006). Vertebrate Hsd17p3 catalyzes 
the conversion of androstenedione to testosterone, 
Hsd1 7pi 2 transforms estrone into estradiol and participates 
in the microsomal elongation of fatty acids, and HsdU is 
highly expressed in steroidogenic tissues, and its function 
has been related to the progression of prostate cancer. Four 
amphioxus enzymes belong to family SDR-1 2C (fig. 2; sup- 
plementary fig. S1 , Supplementary Material online; NJ, 1 00; 
ML, 1,000). Three of them, Bf_Hsd17p12_1-3, group with 
Hsd17p12 and one, BfJHsdU, clusters with HsdU. Of 
these, Bf_Hsd17p12_2 and Bf_Hsd17p12_3 are located 
on the same scaffold (table 1). No sequence orthologous 
to vertebrate Hsd1 7p3 could be identified in the amphioxus 
genome. 

Family SDR-9C: Rdh cluster-Bdh1-Hsd1 1 p2- 
Hsd17p2 Enzymes 

Family SDR-9C includes three groups of vertebrate en- 
zymes: the Rdh cluster, the Hsd-type 2 group, and the 
3-hydroxybutyrate dehydrogenase type 1 (Bdh1) group. 
Altogether, 27 amphioxus sequences group within this 
family (fig. 2; supplementary fig. S1, Supplementary 
Material online; NJ, 98; ML, 999). The vertebrate Rdh clus- 
ter includes six human genes, RDH5, RDH16, similar to 
RDH 16, HSD17/J6, SDR9C7, and DHRS9. Human RDH5 
and RDH16 are involved in visual and general retinol 
oxidation, respectively (Gough et al. 1998; Yamamoto 
et al. 1 999), although both of them also show some activ- 
ity toward steroids. In addition, HSD17P6 and DHRS9 are 
also active against both retinoids and steroids (Biswas and 
Russell 1997; Chetyrkin, Belyaeva, etal. 2001; Chetyrkin, 
Hu, etal. 2001; Soref etal. 2001), whereas SDR9C7 does 
not have a significant activity against steroids and shows 
only very weak retinaldehyde reductase activity (Kowalik 
et al. 2009). A total of 12 amphioxus sequences, named 
Bf_Rdh_1 to 12, group with the vertebrate Rdh cluster 
(fig. 2; supplementary fig. S1, Supplementary Material on- 
line), including two previously characterized retinaldehyde 
reductases of amphioxus (Bf_Rdh_1 and Bf_Rdh_2) (Dalfo 



et al. 2001 , 2007). The genomic linkage data indicate that 
at least 7 of the 1 2 amphioxus Rdh genes cluster together 
on three distinct scaffolds (table 1). In our phylogenetic 
tree, vertebrate and amphioxus Rdh sequences group sep- 
arately, indicating that the extensive expansion of this fam- 
ily in amphioxus and the diversification of the Rdh cluster in 
vertebrates were independent evolutionary events. 

Hsd-type 2 enzymes function as NAD + -dependent oxi- 
dases to convert active steroid hormones to their inactive 
forms. Hsd1ip2 oxidizes glucocorticoids, transforming po- 
tent Cortisol to weak cortisone, whereas Hsd17p2 oxidizes 
potent sex hormones, such as estradiol and testosterone, 
into weak estrone and androstenedione, respectively (Wu 
et al. 1 993; Albiston et al. 1 994). One amphioxus sequence 
branches outside a clade that comprises both vertebrate 
type 2 enzymes (fig. 2; supplementary fig. S1, Supplemen- 
tary Material online). The amphioxus enzyme orthologous to 
both vertebrate forms was therefore named Bf_Hsd1 1/ 
1 7(32. The tree topology indicates that the 1ip- and 17p- 
forms diverged from an ancestral Hsd-type 2 enzyme 
duplicated in the lineage leading to extant vertebrates. This 
divergence in vertebrates of the 1 1 p- and 1 7p-forms has been 
supported by the phylogenetic analysis of Hsd-type 2 sequen- 
ces from sea urchins and acorn worms, which also cluster at 
the base of the vertebrate Hsd-type 2 clade (Baker 2010a). 

We also identified 1 4 amphioxus sequences that clustered 
with vertebrate Bdh1 (fig. 2; supplementary fig. S1 , Supple- 
mentary Material online), a mitochondrial lipid-requiring en- 
zyme that serves to interconvert ketone bodies (Marks et al. 
1992). Two of the 14 amphioxus sequences, named 
Bf_Bdh1_1 and 2, stably group with the vertebrate enzymes, 
whereas the other 12 sequences, Bf_Bdh1_3 to 14, fall out- 
side thiscluster. Interestingly, Bf_Bdh1_1 and 2, togetherwith 
Bf_Bdh1_3, are linked in the amphioxus genome, with the 
other 1 1 amphioxus Bdh1 genes clustering on two different 
scaffolds (table 1). Together, the tree topology and linkage 
data suggest an initial duplication of a Bdh 1 ancestor predat- 
ing the cephalochordate-vertebrate split. Although only one 
of these ancestral Bdh1 duplicates was preserved in verte- 
brates, both of them were retained and independently 
expanded in the amphioxus lineage. 

Family SDR-28C: Hsd17pi-Rdh8 Enzymes 

Vertebrate Hsd17pi is predominantly expressed in ovaries, 
breast tissues and placenta, and has been considered as 
a major determinant of peripheral and gonadal estradiol 
synthesis (Peltoketo et al. 1988, 1999). Rdh8, also known 
as photoreceptor retinol dehydrogenase (prRDH), localizes 
specifically to the outer segments of photoreceptor cells re- 
ducing retinal to retinol in the visual cycle (Rattner et al. 
2000; Maeda et al. 2005, 2007). Our survey of the amphi- 
oxus genome did not reveal any amphioxus sequences clus- 
tering with this family. Nevertheless, the identification of 
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Table 1 

Genomic Linkage in Amphioxus of Retinoid and Steroid Metabolism Genes 
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several putative Rdh8/Hsd17pi sequences in cnidarians (Tar- 
rant et al. 2009) and sea urchins (Mindnich and Adamski 
2009) suggests that the members of this family might have 
been lost specifically in the amphioxus lineage. 

Family SDR-7C: Rdh11-Rdh12-Rdh13-Rdh14 Enzymes 

Vertebrate Rdh11, Rdh12, Rdh13, and Rdh14are retinal- 
dehyde reductases predominantly involved in retinoid me- 
tabolism of the eye (Haeseleer et al. 2002; Kim et al. 2005; 
Lee et al. 2007; Maeda et al. 2007). In addition, human 
RDH 1 1 is also involved in the regulation of retinoid homeo- 
stasis (Lin et al. 2001 ; Kedishvili et al. 2002; Belyaeva et al. 
2005), RDH12 in retinal reduction of extraocular tissues 
(Belyaeva et al. 2005), RDH 13 in protection against retinal 
produced from dietary p-carotenes (Belyaeva, Korkina, 
et al. 2008), and RDH14 in retinal reduction in most human 
tissues (Belyaeva and Kedishvili 2002). Phylogenetically, 
Rdh1 1 and Rdh12 are closely related enzymes that arose 
from an Rdh1 1/1 2 precursor likely during mammalian evo- 
lution, whereas Rdh13 and Rdh14 have diverged from the 
Rdh1 1/12 ancestor early during evolution. In amphioxus, 
we found 24 sequences that belonged to the SDR family 
SDR-7C (fig. 2; supplementary fig. S1, Supplementary 
Material online; NJ, 100; ML, 1,000). One amphioxus se- 
quence, Bf_Rdh13, is closely related to vertebrate Rdh13 
and one, Bf_Rdh14, branches with vertebrate Rdh14, 
whereas the 22 remaining sequences, named Bf_Rdh1 1/ 
1 2_1 to 22, are not clearly associated with any of the verte- 
brate SDR-7C family members. A total of 20 of the 22 
Bf_Rdh1 1/12 sequences (Bf_Rdh1 1/1 2_3 to 22) cluster tog- 
ether in the phylogenetic tree, indicating a lineage-specific 
amplification of these amphioxus Rdh1 1/1 2 sequences. 
Genomic linkage of 16 of the 22 amphioxus Rdh1 1/12 genes 
on five distinct scaffolds (table 1) strongly supports this 
notion. The internal tree topology does not unambiguously 
resolve the phylogenetic relationship of the amphioxus and 
vertebrate enzymes. The phylogenetic analysis nonetheless 
indicates that the chordate ancestor already had multiple 
genes encoding SDR-7C family enzymes, including Rdh1 1/ 
12, Rdh13, and Rdh14, and that some of these ancestral 
sequences experienced a significant expansion in the amphi- 
oxus lineage. 

Family SDR-37C: Hsd17p7 Enzymes 

Hsd1 7p7 enzymes show the typical SDR signature at the co- 
factor-binding motif and at the active site, but their overall 
sequence identity with other SDR families is low. Hsd17p7, 
initially described as prolactin receptor-associated protein 
(PRAP) (Duan et al. 1 996), are cholesterogenic enzymes that 
participate in cholesterol biosynthesis (Mindnich et al. 2004; 
Shehu et al. 2008). Hsd17p7 has been lost in cholesterol 
auxotrophic animals that also lost squalene epoxidase (Sqle), 
a crucial enzyme for cholesterol biosynthesis (Marijanovic 



et al. 2003). There is a single Hsd17p7 ortholog in the 
amphioxus genome (fig. 2; supplementary fig. S1, Supple- 
mentary Material online; NJ, 100; ML, 1,000). The identifi- 
cation of Bf_Hsd17p7, together with the presence of 
a putative Sqle enzyme in amphioxus (GenBank accession 
number XP_002594656), is of evolutionary relevance be- 
cause these findings suggest that cephalochordates are ca- 
pable of the de novo synthesis of cholesterol and hence 
imply that this metabolic pathway was already present in 
the last common ancestor of all chordates. 

Family SDR-30C: Hsd17p8 Enzymes 

Hsd17p8 is an oxidative enzyme inactivating estradiol, tes- 
tosterone, and dihydrotestosterone (Fomitcheva et al. 
1 998). Whereas in humans and zebrafish, there is only a sin- 
gle Hsd17p8, we found six Hsd17p8 genes encoded in the 
amphioxus genome, Bf_ Hsd1 7p8_1 to 6 (fig. 2; supplemen- 
tary fig. S1, Supplementary Material online; NJ, 100; ML, 
1 ,000). Two of these amphioxus sequences, Bf_ Hsd1 7p8_3 
and 4, are linked in the genome (table 1). Altogether, these 
data suggest that members of the Hsd17p8 family were al- 
ready present in the genome of the chordate ancestor and 
that the amphioxus Hsd17p8 gene complement was 
expanded by lineage-specific duplication. 

Family SDR-25C: Dhrs4 Enzymes 

The vertebrate members of this SDR family have been as- 
signed very different enzymatic activities: whereas human 
DHRS4 has been shown to contribute to the reduction of 
3-ketosteroids into 3p-hydroxysteroids (Matsunaga et al. 
2008), the mouse Dhrs4 enzyme has been described as a ret- 
inaldehyde reductase (Lei et al. 2003). Our analysis identified 
one amphioxus representative of this SDR family, Bf_Dhrs4 
(fig. 2; supplementary fig. S1 , Supplementary Material online; 
NJ, 100; ML, 1 ,000). Given the presence of an amphioxus rep- 
resentative in this SDR family, the evolutionary origin of Dhrs4 
enzymes evidently predated chordate divergence. 

Family SDR-5C: Hsd17pi0 Enzymes 

Vertebrate Hsd17pi0 is a multifunctional enzyme that, in 
addition to a 17p-Hsd activity with sex steroids, acts as 
3a-, 7a-, 7p-, 17p-, 20p-, and 21 -oxidase of several different 
substrates (Shafqat et al. 2003). A single amphioxus se- 
quence, named Bf_Hsd17piO, branched at the base of 
the vertebrate representatives of this SDR family (fig. 2; 
supplementary fig. S1, Supplementary Material online; NJ, 
100; ML, 1,000) suggesting that Hsd17p10 enzymes were 
already present in the last common ancestor of amphioxus 
and vertebrates. 

Family SDR-47C: Hsd17pi4 Enzymes 

Hsd17pi 4 was initially suggested to function in retinol me- 
tabolism (Haeseleer and Palczewski 2000), but recent work 
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suggests that the human enzyme acts as a 1 7p-Hsd with sex 
steroids in brain, liver, and placenta (Lukacik et al. 2007). In 
amphioxus, there is a single member of this SDR family, 
Bf_Hsd17pi4 (fig. 2; supplementary fig. S1, Supplementary 
Material online; NJ, 100; ML, 1,000). Hsd17pi4 enzymes 
were thus very likely already present in the last common an- 
cestor of all chordates suggesting that the evolutionary or- 
igin of this SDR family predates the diversification of the 
chordate lineage. 

Family SDR-8C: Hsd17p4 Enzymes 

The vertebrate Hsd17p4 enzyme (also called MFE-2, MFP-2, 
and DBP) is involved in peroxisomal fatty acid p-oxidation 
and its deficiency causes severe abnormalities in several or- 
gans, particularly in the brain (reviewed in Huyghe et al. 
2006). In the amphioxus genome, we identified one se- 
quence closely related to vertebrate Hsd17p4. This 
sequence, Bf_Hsd17p4, marks the base of the vertebrate 
Hsd17p4 enzymes (fig. 2; supplementary fig. S1, Supple- 
mentary Material online; NJ, 100; ML, 1,000) hence indicat- 
ing that this SDR family might have already been present at 
the dawn of chordate diversification. 

Adh and Akr Sequences in Amphioxus 

In vertebrates, retinol dehydrogenase activity has also been 
associated with some members of the Adh family, mainly with 
the Adh1, Adh3, and Adh4 enzymes. Our survey of the 
B. floridae genome supports previous work on Adh evolution 
showing that amphioxus had a single Adh enzyme (Bf_Adh3) 
that groups with the members of the vertebrate Adh family 
(fig. 3A, supplementary fig. S2, Supplementary Material 
online) (Cahestro et al. 2002; reviewed in Gonzalez-Duarte 
and Albalat 2005). Biochemically, amphioxus Adh3 is similar 
to vertebrate Adh3 (Cahestro et al. 2000; Godoy et al. 2006) 
suggesting that the enzyme activities of Adh 1 and Adh4 were 
vertebrate innovations (Cahestro et al. 2003, 2010). 

Several members of the Akr1 group in the Akr superfam- 
ily show activity against retinoids and steroids. Akrlb en- 
zymes that convert glucose to sorbitol can also reduce 
retinal to retinol (Ruiz et al. 2009), and some Akr1 c enzymes 
can act as Hsd enzymes controlling the interconversion of 
weak steroid hormones to potent hormones (Dufort et al. 
1 999). There are 9 Akr1 sequences in humans classified into 
5 subgroups (Akr1 A to E). The survey of the amphioxus ge- 
nome revealed 19Akr1 sequences in amphioxus (Bf_Akr1_1 
to 1 9) and the phylogenetic analysis grouped 1 8 of these 1 9 
amphioxus sequences into four well-supported clades (fig. 
3B, supplementary fig. S3, Supplementary Material online). 
Genomically linked amphioxus Akr1 genes are found only 
within a given clade: linkage groups Akr1_2, 4, and 6 
and Akr1_3 and 5 branch within the same clade; Akr1_7, 
8, 9, 10, and 12 are also grouped in the tree; Akr1_14 
and 16 as well as Akr1_15 and 17 together establish one 
of the four well-supported clades (table 1). 



The tree does not efficiently resolve the phylogenetic re- 
lationship between the 19 amphioxus Akr1 sequences and 
their vertebrate orthologs, but given that amphioxus Akrls 
within the same clade have the tendency to be linked on the 
genome, the expansion of Akr1 enzymes in amphioxus was 
likely independent of the diversification of the vertebrate 
Akr1 proteins. To further investigate the lineage-specific 
Akr1 duplicates in amphioxus, we carried out structural 
analyses of their substrate binding and processing domains. 
In vertebrate Akr1 enzymes, in addition to the invariant res- 
idues Y50 and H1 13 within the active site, the amino acids 
at positions 47, 49, 82, 1 1 4, 1 1 6, 1 24, 1 25, 1 33, 1 35, 299, 
and 301 (numbering relative to human AKR1 A1 ) have been 
associated with substrate binding in the binding pocket (Jez 
et al. 1 997; Penning 1 997). In amphioxus Akr1 s, the conser- 
vation at these positions is highest at residues involved in 
general substrate recognition and binding (e.g., 100% con- 
servation at positions 47, 50, and 113), is intermediate at 
residues determining sugar versus steroid specificity (e.g., 
63.2% conservation at position 49 and 52.6% conservation 
at position 1 1 4), and is lowest at residues contributing to the 
definition of steroid specificity (e.g., 26.3% of conservation 
at position 299 and 1 5.8% of conservation at position 301 ). 
Altogether, this pattern is compatible with the hypothesis 
that at least some of the amphioxus Akrls exhibit the 
capacity to metabolize steroids. 

Amphioxus Retinoid Metabolism 

Bcmol, Bco2, and Rpe65 Family 

In vertebrates, the Bcmol enzyme (also known as Bco) cleaves 
P-carotene into two molecules of retinal (reviewed in Ross 
et al. 2000 and Gottesman et al. 2001). Bcmol is closely re- 
lated to Bco2 (for p,p-carotene-9',10'-oxygenase) and Rpe65 
(for retinal pigment epithelium protein of 65 kDa) enzymes. 
Bco2 catalyzes the asymmetric cleavage of p-carotenes (Kiefer 
et al. 2001), which has been proposed as an alternative 
pathway of RA synthesis (Simoes-Costa et al. 2008), whereas 
Rpe65 converts retinyl esters to retinol for photopigment 
regeneration in the visual cycle (Jin et al. 2005). From an 
evolutionary perspective, identification of Bcmo1/Bco2/Rpe65 
enzymes in urochordates (Takimoto et al. 2006) and Drosoph- 
ila (von Lintig and Vogt 2000) indicates that the cleavage of 
P-carotene is an ancient biochemical activity in metazoans. 

In total, five amphioxus sequences showed similarity with 
vertebrate Bcmol /Bco2/Rpe65 enzymes and grouped with 
the vertebrate enzymes in our phylogenetic analysis (fig. 3C, 
supplementary fig. S4, Supplementary Material online; NJ, 
94; ML, 993). The amphioxus sequences did not group with 
any of the three vertebrate enzymes and are not linked in 
the amphioxus genome (table 1), which is compatible with 
the notion that Bcmol, Bco2, and Rpe65 enzymes under- 
went independent duplication and functional specialization 
in the cephalochordate and vertebrate lineages. 
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Fig. 3.— Diagrammatic tree topologies of the (A) Adh, (B) Akrl, (0 Bco, (D) Aldh, (£) iLbp, (F) NlpC/P60, (G) Dgat, (H) Lipocalin, (/) Ttr/Urah, (J) 
Stra6, (K) Star, (L) Cyp, (M) Hsd3(3, and (A/) Srd5ot families. Simplified phylogenies of the main components of retinoid and steroid metabolism are 
shown. Vertebrate branches are in green, amphioxus branches are in red, and branches containing sequences from the cnidarian Nematostella vectensis 
are in yellow. 



Aldhla, Aldh8a, and Cyp26 Enzymes 

In vertebrates, retinal is mainly oxidized to RA by three 
Aldhla enzymes (Aldh1a1, Aldh1a2, and Aldh1a3) and en- 
dogenous RA is degraded to biologically inactive forms by 



Cyp26 enzymes (Cyp26a, Cyp26b, and Cyp26c). Vertebrate 
Aldh8a1 (also known as Raldh4 and Aldh12) is also capable 
of oxidizing retinal (Lin and Napoli 2000; Lin et al. 2003; 
Liang et al. 2008), although the contribution of Aldh8a1 
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to RA signaling is not yet fully understood. Previous work has 
identified six Aldhla and three Cyp26 in amphioxus, and 
phylogenetic analyses have suggested that this diversity 
of Aldhla and Cyp26 originated by independent duplica- 
tions in the amphioxus and vertebrate lineages (Cahestro 
et al. 2006; Marletaz et al. 2006). Genomic linkage data 
provide additional evidence for this hypothesis: five of the 
six amphioxus Aldhla genes are located on two genome 
scaffolds and all three amphioxus Cyp26 genes are clustered 
on a single scaffold (table 1). 

We have extended the previous analyses by searching for 
Aldh8a1 orthologs in amphioxus and identified one amphi- 
oxus sequence, Bf_Aldh8a. Phylogenetic analysis showed 
that Bf_Aldh8a is orthologous to vertebrate Aldh8a1 (fig. 
3D, supplementary fig. S5, Supplementary Material online; 
NJ, 100; ML, 1,000). The identification of Bf_Aldh8a com- 
pletes the evolutionary analysis in amphioxus of the basic 
genetic machinery controlling the spatiotemporal levels of 
endogenous RA (i.e., Aldhla, Aldh8a, and Cyp26 enzymes) 
and further strengthens the notion that this machinery was 
already present in the last common ancestor of all chordates 
(Cahestro et al. 2006; Marletaz et al. 2006; Campo-Paysaa 
et al. 2008; Albalat and Cahestro 2009; Theodosiou et al. 
2010). 

Cellular Retinoid Binding Proteins: Crbp and Crabp 

Intracellular retinoid-binding proteins, such as Crbp and 
Crabp, bind retinol, retinal and RA to solubilize and stabilize 
the retinoids in the aqueous environment of the cell. Verte- 
brate Crbp and Crabp belong to the family of intracellular 
lipid binding proteins (iLbp), together with three groups of 
fatty acid-binding proteins (Fabps), namely Fabp1/6, Fabp2, 
and Fabp3/4/5/7/8/9 (Schaap et al. 2002). In humans, there 
are 4 Crbp, 2 Crabp, 1 Fabp2, 2 Fabp1/6, and 6 Fabp3/4/5/7/ 
8/9, each one of these proteins being characterized by dif- 
ferent ligand binding properties. In amphioxus, we identi- 
fied 1 1 iLbp. Although one of these amphioxus iLbp, 
Bf_Crbp, strongly groups with vertebrate Crbp (fig. 3E, sup- 
plementary fig. S6, Supplementary Material online; NJ, 97; 
ML, 998) and one, Bf_Fabp1/6, groups with vertebrate 
Fab1/6 (fig. 3E, supplementary fig. S6, Supplementary Ma- 
terial online; NJ, 74; ML, 958), the other nine amphioxus 
sequences, named Bf_iLbp_1 to 9, are not clearly associated 
with any vertebrate iLbp group. Because seven of the nine 
amphioxus iLbp genes are clustered on two genome scaf- 
folds (table 1), it is very likely that these amphioxus genes 
originated by lineage-specific duplication. 

Unlike for Crbp, we were hence unable to identify a clear 
amphioxus ortholog of vertebrate Crabp. Several iLbp family 
members with retinoid-binding capacity have been de- 
scribed in protostomes, such as insects and crustaceans 
(Mansfield et al. 1998; Gu et al. 2002; Folli et al. 2005; 
Soderhall et al. 2006), but sequence analyses and 3D struc- 
tural modeling showed that these protostome iLbps are not 



orthologous to vertebrate Crbp and Crabp (Gu et al. 2002; 
Folli et al. 2005; Soderhall et al. 2006). It is therefore very 
likely that the capacity to bind retinoids has evolved inde- 
pendently in different iLbp families in the course of bilaterian 
diversification. Biochemical analyses will hence be required 
to determine the exact physiological function of each am- 
phioxus member of the iLbp family. 

Lrat and Dgatl Enzymes 

The Lrat enzyme catalyzes retinol esterification into retinyl 
esters, which are accumulated in the vertebrate liver where 
they can be mobilized to maintain retinoid homeostasis. 
Lrats hence participate in retinoid storage mechanisms. Lrat 
enzymes belong to the Lrat-like family of the complex NIpC/ 
P60 superfamily, which unites Lrat with the Hrasl (for Hras- 
like suppressor, also known as H-rev1 07-like protein) and 
Fam84 (for family with sequence similarity 84) subfamilies 
(Anantharaman and Aravind 2003). We found 13 amphi- 
oxus sequences with similarities to vertebrate Lrat-like pro- 
teins. Only one of these sequences grouped with one of the 
three vertebrate subfamilies: Bf_Hrasls, which is weakly as- 
sociated with the vertebrate Hrasls (fig. 3F, supplementary 
fig. S7, Supplementary Material online; NJ, 38; ML, 773). 
The other 12 amphioxus sequences are located on only 
three scaffolds of the genome (table 1 ) and cluster together 
in the phylogenetic tree (fig. 3F, supplementary fig. S7, Sup- 
plementary Material online), indicating that they are the re- 
sult of an extensive lineage-specific duplication. These 12 
amphioxus sequences were called Bf_ Fam84/Hrasls/Lrat_1 
to 12. 

Because phylogenetic analyses did not properly resolve 
the relationships between amphioxus and vertebrate en- 
zymes, we analyzed the amino acid signatures of the differ- 
ent Lrat-like subfamilies. Lrat-like proteins share two 
catalytic residues, H60 and C161 (numbers relative to the 
human LRAT enzyme) (Mondal et al. 2000; Jahng et al. 
2003). C161 is the central cysteine of the highly conserved 
NCE-box of Lrat and Hrasls proteins, whereas in Fam84 
members, this cysteine has been replaced by a serine (result- 
ing in a NSE-box). None of the 1 3 amphioxus sequences ex- 
hibits a NSE-box, suggesting that this amino acid 
replacement in the Fam84 subfamily took place only after 
the split of the amphioxus and vertebrate lineages. For 
H60, the conserved amino acid context is HYGIY in Lrat en- 
zymes, HW A / G 7|/ V Y in Hrasls and HWA v /, F / Y /c in Fam84. For 
this signature, the 1 3 amphioxus sequences all show a Hrasls 
HW A / G L /|/ V Y context, suggesting that the activity of the am- 
phioxus Lrat-like proteins might be most similar to that of 
vertebrate Hrasls. In this context, it is interesting to note that 
the Lrat-like protein EGL-26 of the nematode worm Caeno- 
rhabditis elegans has been functionally implicated in lipid 
metabolism (Estes et al. 2007), a role more similar to that 
of vertebrate Hrasls (Jin et al. 2007; Nazarenko et al. 
2007) than to that of Lrats. Taken together, this suggests 
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that a Hrasls-like activity might be ancestral in this family and 
that Lrat- and Fam84-like activities arose specifically in the 
vertebrate lineage after duplication of an ancestral Hrasls- 
like enzyme. 

In addition to Lrat proteins, enzymes involved in the synthe- 
sis of triacylglycerol, such as acyl coenzyme A:diacylglycerol 
acyltransferase (Dgatl ), are capable of contributing to retinol 
esterification by carrying outtheacyl coenzyme A:retinolacyl- 
transferase reaction (Ross 1 982; O'Byrne et al. 2005; Orland 
et al. 2005). Although the function of vertebrate Dgatl in the 
hepatic control of retinoid homeostasis has been questioned 
(Batten et al. 2004), in the absence of a clear amphioxus Lrat 
ortholog, the contribution of Dgatl enzymes in amphioxus to 
the esterification of retinol for retinyl ester storage deserves 
further investigation. Dgatl enzymes are related to sterol 
O-acyltransferases (Soat) that catalyze the formation of cho- 
lesterol esters from cholesterol. In humans, there are one 
Dgatl and two Soat enzymes. We found two sequences sim- 
ilar to vertebrate Dgatl and Soat enzymes in the amphioxus 
genome and phylogenetic analysis showed that one of theses 
sequences, named Bf_Dgat1 , groups with vertebrate Dgatl , 
whereas the other one, called Bf_Soat1/2, branches with ver- 
tebrate Soatl and Soat2 enzymes(fig. 3G, supplementaryfig. 
S8, Supplementary Material online; NJ, 1 00; ML, 980). Taken 
together, these data suggest that, in the absence of a clear 
amphioxus Lrat ortholog, retinol esterification in amphioxus 
might be catalyzed by Dgat, an enzyme that might have hence 
played a key role in the biosynthesis of retinyl esters in the 
ancestral chordate. 

Rbp4, Ttr, and Stra6 Proteins 

In vertebrates, hepatic retinyl esters can be hydrolyzed 
back to retinol, which subsequently enters the blood 
stream in association with Rbp4 and Ttr proteins. Rbp4 be- 
longs to the lipocalin family, a large group of extracellular 
proteins that bind and transport small hydrophobic mole- 
cules. Although the evolutionary origins of Rbp4 remain 
unclear, another member of this family, apolipoprotein D 
(ApoD), is considered the most ancient metazoan lipocalin. 
Vertebrate Rbp4 is evolutionary related to the prostaglan- 
din D2 synthases (Ptgds)-like group that includes Ptgds, al- 
pha-1-microglobulin/bikunin preproprotein (Ambp), and 
apolipoprotein M (ApoM) (Flower 1996; Ganfornina 
et al. 2000; Gutierrez et al. 2000). 

In amphioxus, we found six lipocalin-like sequences. 
A total of five of those, Bf_ApoD_1 to 5, are similar to 
the vertebrate ApoD proteins, whereas one sequence is 
more closely related to the vertebrate Ptgds-like group 
(fig. 3H, supplementaryfig. S9, Supplementary Material on- 
line; NJ, 72; ML, 761 ). The amphioxus sequence was named 
Bf_Ambp/ApoM/Ptgds/Rbp4 to reflect this phylogenetic re- 
lationship. Since the exon-intron organization of lipocalin 
genes contains information about the evolutionary origins 
of this gene family (Sanchez et al. 2003), we decided to an- 



alyze the genomic locus of Bf_Ambp/ApoM/Ptgds/Rbp4. 
The amphioxus Ambp/ApoM/Ptgds/Rbp4 gene has six cod- 
ing exons and five introns with a 0,2, 1 , 1 , 1 pattern of intron 
phases, which is identical to vertebrate ApoM and Ptgds 
genes but different from vertebrate Rbp4 genes (Sanchez 
et al. 2003). It therefore seems that the lipocalin gene of 
the ancestral chordate was structurally more similar to ex- 
tant ApoM and Ptgds genes than to Rbp4 genes. 

Rbp4 interacts with transthyretin (Ttr) to transport and 
distribute retinol to peripheral tissues. Vertebrate Ttr pro- 
teins are evolutionary related to 5-hydroxyisourate hydro- 
lases (Urah) enzymes (Zanotti et al. 2006). An amphioxus 
sequence similar to vertebrate Ttr and Urah was identified 
in our genomic survey. The phylogenetic analysis was unable 
to resolve, whether the amphioxus sequence is orthologous 
to either vertebrate Ttr or Urah or to both, and we hence 
named the sequence Bf_Ttr/Urah (fig. 31, supplementary 
fig. S10, Supplementary Material online). Vertebrate Ttr 
and Urah, however, exhibit different amino acids at key po- 
sitions for substrate accommodation and catalysis: H11, 
D49, R51, H102, Y115, and S1 18 in Urah (numbers refer 
to the mouse Urah sequence because no human Urah 
has so far been identified) versus K35, S72, E74, T126, 
T1 39, and V1 42 in Ttr (numbers refer to the human TTR se- 
quence). In the amphioxus Ttr/Urah, the amino acids at 
these key positions correspond more closely to those of ver- 
tebrate Urah, which suggests a 5-hydroxyisourate hydrolase 
activity of the amphioxus protein. These data therefore sup- 
port the notion that the hormone transport protein Ttr arose 
during vertebrate evolution by duplication from an ancestral 
Urah-like enzyme (Zanotti et al. 2006). 

After transport to the target tissue, retinol uptake into the 
cell is mediated by Stra6, a cell-surface receptor for Rbp4 (Ka- 
waguchi et al. 2007; Theodosiou et al. 2010). Stra6 proteins 
show similarity with a group of vertebrate proteins of un- 
known function, called Stra6l (for Stra6-like). We identified 
four amphioxus sequences with similarity to vertebrate Stra6 
in the amphioxus genome, but phylogenetic reconstruction 
groups the amphioxus sequences with the vertebrate Stra6l 
proteins (fig. 37, supplementary fig. S1 1 , Supplementary Ma- 
terial online; NJ, 74; ML, 982). We concluded, therefore, that 
there is no Stra6 ortholog in amphioxus, suggesting that 
Stra6 was either lost in cephalochordates or originated spe- 
cifically in the vertebrate lineage. 

In summary, our searches for Rbp4, Ttr, and Stra6 ortho- 
logs in the amphioxus genome suggest that amphioxus 
probably lacks a vertebrate-like system of retinoid transport 
and uptake. Functional studies are needed to determine 
whether some of the sequences we have identified in the 
genome might participate in an alternative system in amphi- 
oxus for the mobilization and delivery of retinoids from stor- 
age reservoirs and hence whether some kind of retinoid 
storage, mobilization, and delivery system was active in 
the ancestral chordate. 
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Steroidogenesis Enzymes in Amphioxus 
Star and Cyp11a 

The first reaction in the steroidogenesis pathway is the syn- 
thesis of pregnenolone from cholesterol in the inner mem- 
brane of mitochondria. In vertebrates, cholesterol is 
delivered to mitochondria by members of the Star family 
(Clark et al. 1994; Watari et al. 1997; Stocco 2001), all of 
which contain a START domain required for intracellular 
lipid transport, lipid metabolism, and cell signaling pro- 
cesses. Members of the Star family include multiple Stard 
(for START domain) proteins, Did (for deleted in liver can- 
cer 1), Col4a3bp (for collagen, type IV, alpha 3 binding 
protein), and Pctp (for phosphatidylcholine transfer pro- 
tein). We searched the B. floridae genome for Star family 
members and found six amphioxus sequences similar to 
vertebrate Stard proteins that, based on their phylogenetic 
relationships, were named Bf_Star/Stard3, Bf_Stard7, 
Bf_Stard10, Bf_Dlc1/Stard8/13, Bf_Col4a3bp, and 
Bf_Pctp (fig. 3/C, supplementary fig. S12, Supplementary 
Material online). In our phylogenetic analysis, Bf_Star/ 
Stard3 reliably groups with vertebrate Star/Stard3 proteins 
(fig. 3/C, supplementary fig. S12, Supplementary Material 
online; NJ, 85; ML, 999). This amphioxus sequence is 
therefore a good candidate to mediate the delivery of cho- 
lesterol to the inner mitochondrial membrane of amphi- 
oxus. 

Once in the mitochondrion, synthesis of pregnenolone 
from cholesterol is carried out by Cyp1 1a enzymes, which 
belong to the mitochondrial clan of the cytochrome P450 
enzymes. The mitochondrial Cyp clan includes vertebrate 
Cyp11, Cyp24, and Cyp27 as well as some invertebrate 
Cyp enzymes (Nelson 2009). Our survey of the amphioxus 
genome identified 12 sequences grouping within the mito- 
chondrial Cyp clan (fig. 3L, supplementary fig. S1 3, Supple- 
mentary Material online; NJ, 97; ML, 1,000). The 
phylogenetic analyses do not resolve the evolutionary rela- 
tionships between the amphioxus and vertebrate sequen- 
ces. In total, eight sequences, named Bf_Cyp1 1/24/27_1 
to 8, grouped at the base of the vertebrate mitochondrial 
Cyp enzymes, three sequences, Bf_Cyp24/27_1 to 3, seem 
closely related to vertebrate Cyp24 and Cyp27, and one am- 
phioxus sequence, Bf_Cyp27, stably groups with vertebrate 
Cyp27 (fig. 3L, supplementary fig. S13, Supplementary 
Material online; NJ, 46; ML, 988). Of the amphioxus 
mitochondrial Cyp clan members, six of the eight 
Bf_Cyp1 1/24/27 genes are genomically linked, clustering 
on a single genome scaffold (table 1). 

Given the absence of crystal structures for mitochondrial 
Cyp enzymes, only very limited structural information of 
these proteins is available (Storbeck et al. 2007). We were 
hence not able to carry out meaningful structural compar- 
isons between the amphioxus and vertebrate sequences to 
assist with the inference of evolutionary relationships be- 



tween the different members of this Cyp clan. In conclusion, 
numerous amphioxus mitochondrial Cyp sequences were 
identified, but the physiological contribution of these en- 
zymes to the synthesis of pregnenolone from cholesterol 
remains to be determined. 

Hsd3/3 Enzymes 

Pregnenolone is metabolized to progesterone by Hsd3p 
enzymes. Hsd3p enzymes are unconventional SDR en- 
zymes that belong to the extended category of SDR pro- 
teins (Bray et al. 2009). For the sake of clarity, they were 
not included in the global SDR analysis but have been stud- 
ied separately. The Hsd3p clade includes three human en- 
zymes, namely HSD3p1, HSD3P2, and HSD3P7, which are 
evolutionarily related to Nsdhl (for NAD(P)-dependent 
steroid dehydrogenase-like) and Sdr42e1 (for SDR family 
42E, member 1) enzymes (Bray et al. 2009). We identified 
nine sequences similar to these vertebrate enzymes in the 
amphioxus genome. Phylogenetic analysis shows that 7 
sequences, Bf_Hsd3p_1 to 7, are similar to vertebrate Hsd3p 
(fig. 3M, supplementary fig. S14, Supplementary Material 
online; NJ, 100; ML, 1,000), one sequence, Bf_Nsdhl, groups 
with vertebrate Nsdhl (fig. 3M, supplementary fig. S14, 
Supplementary Material online; NJ, 100; ML, 1,000), and 
one sequence, Bf_Sdr42e1, is associated with vertebrate 
Sdr42e1 (fig. 3M, supplementary fig. S14, Supplementary 
Material online; NJ, 1 00; ML, 1 ,000). Although the seven am- 
phioxus Hsd3p genes do not seem to be linked in the genome 
(table 1), tree topology nonetheless clearly indicates that the 
amphioxus genome contains sequences orthologous to ver- 
tebrate Hsd3p and that the multiple amphioxus Hsd3ps arose 
by linage-specific duplication, independent of the Hsd3p 
diversification of vertebrates. 

Cyp17, Cyp19, and Cyp21 Enzymes 

In steroidogenesis, vertebrate Cyp17 converts progester- 
one to 17oc-OH-progesterone and to androstenedione, 
a precursor of active androgens and estrogens. Cyp17 is 
also capable of transforming pregnenolone to 1 7a-hydrox- 
ypregnenolone and to dehydroepiandrosterone (DHEA). 
Cyp17 belongs to clan 2 of the cytochrome P450 super- 
family, which also includes Cyp1, Cyp2, and Cyp21. We 
found 12 B. floridae sequences belonging to this Cyp clan 
2 (fig. 3L, supplementary fig. S1 3, Supplementary Material 
online; NJ, 95; ML, 1 ,000). Two of these, Bf_Cyp1 7_1 and 
2, are orthologous to vertebrate Cyp1 7, whereas the other 
1 0, Bf_Cyp2_1 to 1 0, group with vertebrate Cyp2. Interest- 
ingly, our phylogenetic analysis suggests that there is no 
amphioxus ortholog of vertebrate Cyp21, which is an 
essential enzyme for the synthesis of corticosteroids in 
vertebrates. 

In vertebrates, androstenedione produced by Cyp17 en- 
zymes is transformed by Cyp1 9, the so-called aromatase, to 
estrone. Cyp19 enzymes have also been involved in the 
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synthesis of estradiol from testosterone. In the B. floridae 
genome, there are two sequences, Bf_Cyp19_1 and 2, that 
are orthologous to vertebrate Cyp1 9 (fig. 3L, supplementary 
fig. S13, Supplementary Material online; NJ, 100; ML, 
1,000). These two amphioxus Cyp19 sequences are most 
likely the result of a lineage-specific duplication. 

Srd5(x Enzymes 

Testosterone can be transformed to 5oc-dihydrotestosterone 
(5ocDHT), a potent sex steroid, by the action of type 1 , type 2, 
and type 3 Srd5ot enzymes (Russell and Wilson 1994; 
Uemura et al. 2008). Srd5ot enzymes are found in inverte- 
brates and vertebrates and are phylogenetically related to 
the Tecr enzymes (for trans-2,3-enoyl-CoA reductase, also 
known as glycoprotein synaptic 2 or Gpsn2) (Markov et al. 
2009). We identified two B. floridae sequences, named 
Bf_Srd5ot1/2_1 and 2, that group within the vertebrate 
Srd5a1/2 clade (fig. 3N, supplementary fig. S1 5, Supplemen- 
tary Material online; NJ, 97; ML, 1,000), one sequence, 
Bf_Srd5oc3, that branches with the vertebrate Srd5oc3 
enzymes (fig. 3N, supplementary fig. S15, Supplementary 
Material online; NJ, 99; ML, 999), and one sequence, Bf_Tecr, 
that clusters with the vertebrate Tecr proteins (fig. 3N, 
supplementary fig. S15, Supplementary Material online; NJ, 
1 00; ML, 1 ,000). These phylogenetic data support the notion 
that the amphioxus genome encodes enzymes with Srd5oc 
activity and that some of these amphioxus genes arose by 
lineage-specific duplication. 

Discussion 

In this study, we have investigated the evolutionary history 
of the chordate retinoid and steroid signaling systems by car- 
rying out an exhaustive analysis of the amphioxus genome. 
Our evolutionary inferences are based on the premises that 
orthologs typically have similar functions (Tatusov et al. 
1997; Eisen 1998; Gabaldon and Huynen 2004) and that 
integrative functional predictions combining phylogenetic 
information, genomic linkage data, concerted gene gains 
or losses, and structural analyses are a powerful tool to pre- 
dict the biological processes an orthologous protein partic- 
ipates in (Gabaldon and Huynen 2004). Using this 
combinatorial approach, we have thus evaluated the pres- 
ence of the genetic machinery for retinoid and steroid sig- 
naling in amphioxus (fig. 4). We will now explore the 
implications of our findings on the evolutionary history of 
retinoid and steroid signaling in chordates, paying special 
attention to the evolution of new physiological capabilities 
in the vertebrate lineage. 

Evolution of Retinoid Signaling 

In the vertebrate RAsignaling cascade, several enzymes, bind- 
ing proteins, and transport elements constitute complex 



machineries that control the physiological levels of RA. Our 
global analysis of the genome of the cephalochordate amphi- 
oxus, the extant species most closely resembling the last in- 
vertebrate ancestor of vertebrates (Schubert, Escriva, et al. 
2006; Holland et al. 2008; Putnam et al. 2008), gives us 
the unique opportunity to attempt the reconstruction of 
the RA pathway in this ancestor and hence to evaluate the 
roles RA signaling might have played in the evolutionary di- 
versification of vertebrates (fig. 4). 

We searched the genome of the amphioxus B. floridae for 
sequences similar to all vertebrate proteins involved in the 
retinoid pathway (fig. 1). Phylogenetic, genomic, and struc- 
tural analyses indicate that amphioxus has 1) SDR-retinol 
dehydrogenases putatively capable to catalyze retinol oxida- 
tion and retinal reduction (Bf_Rdh_1 to 12, Bf_Rdh10, 
Bf_Rdh1 1/1 2_1 to 22, Bf_Rdh13, Bf_Rdh14, Bf_Dhrs3_1 
and 2, and Bf_Dhrs4), 2) enzymes for RA synthesis and deg- 
radation (Bf_Aldh1a_1 to 6, Bf_Aldh8a and Bf_Cyp26_1 to 
3), 3) enzymes likely involved in p-carotene cleavage 
(Bf_Bcmo1/Bco2/Rpe65_1 to 5), and 4) candidates for lipid 
binding proteins possibly involved in retinoid stabilization 
and protection within the cell (Bf_Crbp and Bf_iLbp_1 to 
9). Our screening of the amphioxus genome is consistent 
with the results of previous analyses demonstrating the pres- 
ence in amphioxus of single copies of RA and retinoid X re- 
ceptors (RAR and RXR, respectively) (Holland LZ and Holland 
ND 1 996; Cahestro et al. 2001 ; Escriva et al. 2002; Schubert 
et al. 2004, 2005; Schubert, Holland, et al. 2006; Koop et al. 
201 0) and of active retinoid metabolism (Dalfo et al. 2001 , 
2002). Moreover, the lineage-specific duplications of some 
key components of the retinoid pathway might actually 
reflect very intricate control mechanisms for retinoid 
metabolism throughout the amphioxus life cycle. 

In contrast, our searches for Lrat, Rbp4, Ttr, and Stra6 
suggest that amphioxus might lack counterparts of these 
components of vertebrate retinoid signaling. The amphioxus 
genome contains sequences related to these proteins, but 
structural analyses indicate that the amphioxus representa- 
tives of these families more closely resemble vertebrate pro- 
teins with functions that are not directly linked to the 
retinoid pathway. This concerted absence of the four main 
components involved in storage, transport, and cellular 
uptake of retinoids suggests that this system might be 
a functional novelty of vertebrates. Elaboration of this axis 
of retinoid metabolism might have enabled vertebrates to 
improve the control of retinoid homeostasis and to 
compensate retinoid fluctuations in natural environments. 

Evolution of Steroid Signaling 

Comparative analyses of steroidogenic enzymes has led to 
the conclusion that some enzymatic activities arose inde- 
pendently in arthropods and vertebrates and, therefore, 
that steroidogenesis has been elaborated in parallel in the 
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Fig. 4. — The evolutionary diversification of the retinoid and steroid genetic machineries in chordates. Gene duplications and functional 
innovations are shown for both the cephalochordate and the vertebrate lineages. The names of duplicated genes and gene families are indicated for 
both cephalochordates and vertebrates and several functional innovations resulting from the evolution of novel protein functions are proposed. The "?" 
highlights that additional studies assessing the biochemical and functional properties of the cephalochordate retinoid and steroid machineries are 
required to provide experimental support for the hypotheses derived from our evolutionary analyses. 



main bilaterian lineages (Markov et al. 2009). The origin of 
the enzymatic activities within the chordate phylum, how- 
ever, remains uncertain, and the survey of steroidogenic en- 
zymes (fig. 1) in amphioxus is crucial to obtain insights into 
the evolution of the steroid pathway in chordates and ver- 
tebrates (fig. 4). 

Our analysis reveals that, with some minor and very in- 
triguing exceptions, the amphioxus genome contains most 
of the genetic machinery for steroid metabolism. The rep- 
ertoire of steroidogenic enzymes is even significantly larger 
than previously anticipated (Baker 2004a). Amphioxus has 
orthologs of vertebrate Cyp17 (Bf_Cyp17_1 and 2), 
Cyp19 (Bf_Cyp19_1 and 2), Star (Bf_Star/Stard3), Hsd3p 
(Bf_Hsd3p_1 to 7), Srd5oc (Bf_Srd5oc1/2_1 and 2 and 
Bf_Srd5oc3) and of mitochondrial Cyp enzymes evolution- 
ary related to vertebrate Cyp11a (Bf_Cyp1 1/24/27J to 
8). Although Cyp19 and Hsd3p activities have been de- 
tected in amphioxus gonadal tissues (Callard et al. 1984; 
Mizuta et al. 2008), and although docking studies predict, 
for example, that Cyp1 9 can bind androgens (Callard et al. 



201 1), the biochemical activity of most of these amphioxus 
enzymes still requires experimental confirmation. Neverthe- 
less, based on this collection of amphioxus enzymes and 
supported by the presence of sex steroids in amphioxus 
(Chang et al. 1985; Fang et al. 2001; Takeda et al. 2003; 
Mizuta and Kubokawa 2007), we hypothesize that a verte- 
brate-like sex steroid metabolism exists in cephalochordates. 
In contrast, amphioxus probably lacks Cyp21, which in ver- 
tebrates is an enzyme instrumental for corticosteroid syn- 
thesis. The absence of Cyp21 and the fact that C21 
hydroxylated steroids could not be detected after incubation 
of amphioxus tissues with pregnenolone (Mizuta et al. 
2008) together suggest that cephalochordates might not 
be capable of synthesizing corticosteroids and therefore that 
adrenal steroid hormones were a vertebrate innovation. 

In amphioxus, sex steroids, including progesterone, es- 
trone, estradiol and testosterone, have been detected by 
RIA and immunohistochemical methods (Chang et al. 
1985; Fang et al. 2001; Takeda et al. 2003; Mizuta and 
Kubokawa 2007), and two steroid receptors, ER and SR, 
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have been characterized (Bridgham et al. 2008; Paris et al. 
2008; Katsu et al. 201 0; Callard et al. 201 1). Amphioxus ER 
is orthologous to vertebrate estrogen receptor (ER) and am- 
phioxus SR is orthologous to the vertebrate steroid receptors 
(SR), including androgen receptor (AR), glucocorticoid re- 
ceptor (GR), mineralocorticoid receptor, (MR) and progester- 
one receptor (PR) (Bridgham et al. 2008; Paris et al. 2008; 
Schubert etal. 2008; Katsu etal. 2010; Callard etal. 2011). 
Although amphioxus ER does not activate transcription and 
seems to be an estrogen insensitive inhibitor of SR activity 
(Bridgham et al. 2008; Katsu et al. 2010; Callard et al. 
201 1 ), amphioxus SR is activated by estrogens and mediates 
transcriptional activation through ER- and AR-like DNA re- 
sponse elements (Bridgham et al. 2008; Katsu et al. 2010). 
Although the binding of amphioxus ER and SR to other sex 
steroids, such as progesterone or testosterone, has not been 
demonstrated, 3D modeling of amphioxus SR suggests that 
it may also bind androgen steroids (Baker and Chang 2009). 
In sum, given the relative complexity of the amphioxus ste- 
roidogenic system due to various lineage-specific duplica- 
tions, the study of the enzymatic activities of the 
different steroidogenic enzymes in amphioxus, thecomplete 
analysis of the binding capacities of the two amphioxus ste- 
roid receptors, and the detailed assessment of 
the physiological roles of steroids in amphioxus are crucial 
for understanding the elaboration of steroid-dependent 
signaling not only in cephalochordates but also in vertebrates. 

Evolution of Prereceptor Regulation 

In the steroid pathway, hydroxysteroid dehydrogenases reg- 
ulate the steroid response in vertebrates by controlling hor- 
mone activation and inactivation, a mechanism known 
as prereceptor regulation of steroid action (reviewed in 
Penning 2003). Vertebrate Hsd1 7(3 control sex steroids, 
whereas Hsd11(3 regulate adrenal hormones. Amphioxus 
has many Hsd17[3 enzymes: one amphioxus Hsd1 1/17(32, 
six Hsd17p8, one Hsd1 7(310, and two Hsd17p11/13 are 
candidates to catalyze oxidation of sex steroids, whereas 
one Hsd17p7, three Hsd17pi2, and one Hsd17pi4 might 
be responsible for steroid reduction. Amphioxus, therefore, 
has the genetic machinery necessary for enzymatically acti- 
vating and inactivating sex steroids, pushing back the evolu- 
tionary origins for prereceptor control of sex steroid responses 
to the origin of chordates. In contrast, given the likely absence 
of the corticosteroidogenic Cyp21 enzyme in amphioxus for 
synthesis of adrenal steroids, a prereceptor control for corti- 
costeroid action is probably not required in amphioxus. Even 
though the amphioxus genome contains 23 Hsd1 1 pi/1 l-like 
and 1 Hsd1 1/17p2-like sequences, evolutionary and func- 
tional studies are compatible with these enzymes metaboliz- 
ing steroids other than corticosteroids (Kusakabe et al. 2003; 
Baker 2004b, 2010b). The evolution of Hsd11p1 and 
Hsd1ip2 activities against corticosteroids would thus be 



functional innovations of vertebrates that allowed the elab- 
oration of a prereceptor control of the adrenal hormone re- 
sponse in the vertebrate lineage. 

In vertebrates, retinoid action is tightly controlled spatio- 
temporally by the coordinated activity of enzymes synthesiz- 
ing (i.e., Aldhla and Aldh8a) and degrading (i.e., Cyp26) 
endogenous RA (Niederreither et al. 2002; Reijntjes et al. 
2005; Campo-Paysaa et al. 2008; Theodosiou et al. 
2010). This metabolic system controlling the physiological 
levels of retinoids available for receptor occupancy is remi- 
niscent of the prereceptor system for steroid action. Our 
phylogenetic data indicate that this prereceptor regulation 
is also functioning in amphioxus and was hence already 
present in the last common ancestor of amphioxus and ver- 
tebrates. Moreover, because in amphioxus the components 
of retinoid prereceptor regulation have been subjected to 
lineage-specific duplications (there are six Aldhla, one Ald- 
h8a, and three Cyp26 enzymes encoded in the amphioxus 
genome), it is possible that this system has secondarily been 
elaborated in this lineage, creating a rather complex meta- 
bolic regulation of retinoid signaling, which contrasts the 
simplicity of the retinoid receptor system, which in cepha- 
lochordates is based on a single RAR and a unique RXR. 

Supplementary Material 

Supplementary figures S1-S15 are available at Genome 
Biology and Evolution online (http://www.gbe. oxfordjournals. 
org/). 
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